skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Sha, Yutong"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. LMNA-related dilated cardiomyopathy (DCM) is an autosomal-dominant genetic condition with cardiomyocyte and conduction system dysfunction often resulting in heart failure or sudden death. The condition is caused by mutation in the Lamin A/C (LMNA) gene encoding Type-A nuclear lamin proteins involved in nuclear integrity, epigenetic regulation of gene expression, and differentiation. The molecular mechanisms of the disease are not completely understood, and there are no definitive treatments to reverse progression or prevent mortality. We investigated possible mechanisms of LMNA-related DCM using induced pluripotent stem cells derived from a family with a heterozygous LMNA c.357-2A>G splice-site mutation. We differentiated one LMNA-mutant iPSC line derived from an affected female (Patient) and two non-mutant iPSC lines derived from her unaffected sister (Control) and conducted single-cell RNA sequencing for 12 samples (four from Patients and eight from Controls) across seven time points: Day 0, 2, 4, 9, 16, 19, and 30. Our bioinformatics workflow identified 125,554 cells in raw data and 110,521 (88%) high-quality cells in sequentially processed data. Unsupervised clustering, cell annotation, and trajectory inference found complex heterogeneity: ten main cell types; many possible subtypes; and lineage bifurcation for cardiac progenitors to cardiomyocytes (CMs) and epicardium-derived cells (EPDCs). Data integration and comparative analyses of Patient and Control cells found cell type and lineage-specific differentially expressed genes (DEGs) with enrichment, supporting pathway dysregulation. Top DEGs and enriched pathways included 10 ZNF genes and RNA polymerase II transcription in pluripotent cells (PP); BMP4 and TGF Beta/BMP signaling, sarcomere gene subsets and cardiogenesis, CDH2 and EMT in CMs; LMNA and epigenetic regulation, as well as DDIT4 and mTORC1 signaling in EPDCs. Top DEGs also included XIST and other X-linked genes, six imprinted genes (SNRPN, PWAR6, NDN, PEG10, MEG3, MEG8), and enriched gene sets related to metabolism, proliferation, and homeostasis. We confirmed Lamin A/C haploinsufficiency by allelic expression and Western blot. Our complex Patient-derived iPSC model for Lamin A/C haploinsufficiency in PP, CM, and EPDC provided support for dysregulation of genes and pathways, many previously associated with Lamin A/C defects, such as epigenetic gene expression, signaling, and differentiation. Our findings support disruption of epigenomic developmental programs, as proposed in other LMNA disease models. We recognized other factors influencing epigenetics and differentiation; thus, our approach needs improvement to further investigate this mechanism in an iPSC-derived model. 
    more » « less
  2. Abstract Time-series single-cell RNA sequencing (scRNA-seq) datasets provide unprecedented opportunities to learn dynamic processes of cellular systems. Due to the destructive nature of sequencing, it remains challenging to link the scRNA-seq snapshots sampled at different time points. Here we present TIGON, a dynamic, unbalanced optimal transport algorithm that reconstructs dynamic trajectories and population growth simultaneously as well as the underlying gene regulatory network from multiple snapshots. To tackle the high-dimensional optimal transport problem, we introduce a deep learning method using a dimensionless formulation based on the Wasserstein–Fisher–Rao (WFR) distance. TIGON is evaluated on simulated data and compared with existing methods for its robustness and accuracy in predicting cell state transition and cell population growth. Using three scRNA-seq datasets, we show the importance of growth in the temporal inference, TIGON’s capability in reconstructing gene expression at unmeasured time points and its applications to temporal gene regulatory networks and cell–cell communication inference. 
    more » « less
  3. In biology, cell-fate decisions are controlled by complex gene regulation. Although gene expression data may be collected at multiple time points, it remains difficult to construct the continuous dynamics from the data. In this work, we developed a data-driven approach, NeuralGene, a model based on neural ordinary differential equations (ODEs), to reconstruct continuous dynamical systemsgoverning gene regulation from temporal gene expression data. In addition, NeuralGene has the flexibility of incorporating partial prior biological information in the model to further improve its accuracy. For a given cell at a static time point, the NeuralGene model can impute its continuous gene expression dynamics and predict its cell fate. We applied NeuralGene to a simulation toggle-switch model to verify its utility in modeling and reconstructing temporal dynamics. In addition, NeuralGene was applied to experimental single-cell qPCR data to show its ability for gene expression imputation and cell-fate prediction. 
    more » « less
  4. null (Ed.)
    Epithelial-to-mesenchymal transition (EMT) plays an important role in many biological processes during development and cancer. The advent of single-cell transcriptome sequencing techniques allows the dissection of dynamical details underlying EMT with unprecedented resolution. Despite several single-cell data analysis on EMT, how cell communicates and regulates dynamics along the EMT trajectory remains elusive. Using single-cell transcriptomic datasets, here we infer the cell–cell communications and the multilayer gene–gene regulation networks to analyze and visualize the complex cellular crosstalk and the underlying gene regulatory dynamics along EMT. Combining with trajectory analysis, our approach reveals the existence of multiple intermediate cell states (ICSs) with hybrid epithelial and mesenchymal features. Analyses on the time-series datasets from cancer cell lines with different inducing factors show that the induced EMTs are context-specific: the EMT induced by transforming growth factor B1 (TGFB1) is synchronous, whereas the EMTs induced by epidermal growth factor and tumor necrosis factor are asynchronous, and the responses of TGF-β pathway in terms of gene expression regulations are heterogeneous under different treatments or among various cell states. Meanwhile, network topology analysis suggests that the ICSs during EMT serve as the signaling in cellular communication under different conditions. Interestingly, our analysis of a mouse skin squamous cell carcinoma dataset also suggests regardless of the significant discrepancy in concrete genes between in vitro and in vivo EMT systems, the ICSs play dominant role in the TGF-β signaling crosstalk. Overall, our approach reveals the multiscale mechanisms coupling cell–cell communications and gene–gene regulations responsible for complex cell-state transitions. 
    more » « less
  5. Single-cell analysis of human basal cell carcinoma shows a WNT5A reactive stroma that promotes tumor HSP70 to maintain growth. 
    more » « less
  6. null (Ed.)
    Abstract Rapid growth of single-cell transcriptomic data provides unprecedented opportunities for close scrutinizing of dynamical cellular processes. Through investigating epithelial-to-mesenchymal transition (EMT), we develop an integrative tool that combines unsupervised learning of single-cell transcriptomic data and multiscale mathematical modeling to analyze transitions during cell fate decision. Our approach allows identification of individual cells making transition between all cell states, and inference of genes that drive transitions. Multiscale extractions of single-cell scale outputs naturally reveal intermediate cell states (ICS) and ICS-regulated transition trajectories, producing emergent population-scale models to be explored for design principles. Testing on the newly designed single-cell gene regulatory network model and applying to twelve published single-cell EMT datasets in cancer and embryogenesis, we uncover the roles of ICS on adaptation, noise attenuation, and transition efficiency in EMT, and reveal their trade-off relations. Overall, our unsupervised learning method is applicable to general single-cell transcriptomic datasets, and our integrative approach at single-cell resolution may be adopted for other cell fate transition systems beyond EMT. 
    more » « less
  7. Abstract Genetic mutations to the Lamin A/C gene (LMNA) can cause heart disease, but the mechanisms making cardiac tissues uniquely vulnerable to the mutations remain largely unknown. Further, patients withLMNAmutations have highly variable presentation of heart disease progression and type.In vitropatient-specific experiments could provide a powerful platform for studying this phenomenon, but the use of induced pluripotent stem cell-derived cardiomyocytes (iPSC-CM) introduces heterogeneity in maturity and function thus complicating the interpretation of the results of any single experiment. We hypothesized that integrating single cell RNA sequencing (scRNA-seq) with analysis of the tissue architecture and contractile function would elucidate some of the probable mechanisms. To test this, we investigated five iPSC-CM lines, three controls and two patients with a (c.357-2A>G) mutation. The patient iPSC-CM tissues had significantly weaker stress generation potential than control iPSC-CM tissues demonstrating the viability of ourin vitroapproach. Through scRNA-seq, differentially expressed genes between control and patient lines were identified. Some of these genes, linked to quantitative structural and functional changes, were cardiac specific, explaining the targeted nature of the disease progression seen in patients. The results of this work demonstrate the utility of combiningin vitrotools in exploring heart disease mechanics. 
    more » « less